Scaling breakdown in flow fluctuations on complex networks 
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We propose a model of random diffusion to investigate flow fluctuations in complex networks. We derive an 
analytical law showing that the dependence of fluctuations with the mean traffic in a network is ruled by the 
delicate interplay of three factors, respectively of dynamical, topological and statistical nature. In particular, we 
demonstrate that the existence of a power-law scaling characterizing the flow fluctuations at every node in the 
network can not be claimed for. We show the validity of this scaling breakdown under quite general topological 
and dynamical situations by means of different traffic algorithms and by analyzing real data. 

PACS numbers: 89.75.-k, 89.20.Hh, 89.75.Da 



Communication networks 01 are nowadays the subject of 
intense research as modern society increasingly depends on 
them. On the one hand, the first studies have dealt with the 
architecture of these systems, showing that the systems' topo- 
logical features ljj,|2j,|3|] are at the root of the critical behavior 
of several dynamical processes taking place on top of them 
(H 01 ■ On the other hand, models for traffic and information 
flow on complex networks have been recently investigated as 
a way to improve our understanding on key issues such as the 
scalability, robustness, performance and dynamics of techno- 
logical networks U 0]. In particular, much effort has been 
invested in finding what are the conditions for an efficient per- 
formance of communication networks, the latter being mea- 
sured as the ability of the system to avoid congestion and re- 
duce transit times J5[0,0,0|. Nevertheless, large communica- 
tion networks such as the Internet usually avoid the regime in 
which congestion arises and therefore the dynamics of packets 
is not driven by congestion processes. Instead, the fluctuations 
in traffic flow constitute the main factor affecting the dynam- 
ics of these communication systems. 

The relationship between the fluctuations a and the average 
flux (/) in traffic dynamics on complex networks is a contro- 
versial issue that has received a lot of attention very recently 

[TUl . The authors of Refs. JH [loll claimed the existence 
of the relation a ~ (f) a , with real communication networks 
belonging to one of two universality classes, the first one char- 
acterized by an exponent value a = 1/2, the second one by 
a = 1. The authors of Hill questioned the existence of the 
two universality classes. They numerically showed that there 
is a wide spectrum of possible values for a, depending on pa- 
rameters such as the persistence of packets in the network, 
the duration of the time window during which statistics are 
recorded, and the rate of service at the nodes' queues ll 111 . 

In this Letter, we propose a model for traffic in complex net- 
works, the Random Diffusion (RD) model, that is amenable 
to analytical solution. The model predicts the existence of a 
simple law that relates the fluctuations at a node i, ui to the 
average traffic flow /j, depending on the delicate balance of 



three quantities: (i) the variation in the number of packets in 
the network, (ii) the degree of the node i, and (iii) the length 
of the time window in which measures of traffic flow are per- 
formed. Notwithstanding its simplicity, the RD model is able 
to capture the essential ingredients determining the scaling of 
fluctuations empirically observed for traffic flow in real com- 
plex networks. More important, we also show that the hy- 
pothesis of a power-law scaling of flow fluctuations has to 
be abandoned under certain conditions. Results of numerical 
simulations of a traffic-aware model and analysis of real data 
of Internet flow confirm our theoretical findings. 

In the random diffusion (RD) model we represent packets 
of information as w random walkers traveling in a network 
made up of TV nodes and K links. Under the assumption that 
the packets are not interacting, it follows that the average num- 
ber of walkers Ai at a node i is given, in the stationary regime, 
byEB 
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Let us assume that the total observation time T is divided into 
time-windows of equal length. Each window is made of M 
time units. A window represents the minimal resolution for 
measurements of the flux in a node and its fluctuations, be- 
ing the first the result of accumulating the number of packets 
traveling through the node during the M time units. The av- 
erage number of packets (/j) processed by node i in a time 
window is measured, together with its standard deviation <7j. 
These are the two quantities monitored in Refs.(0, Ll ] for real 
systems and in the numerical simulations of network traffic 
models. The main interest is to investigate the dependence of 
fjj with (/,•). In particular, we want to verify whether a power- 
law relation ~ (/,)" holds, and what factors determine the 
exponent a. In the RD model we can consider two possible 
situations: either the number of packets in the network is con- 
stant over the whole period of time T, namely w = W, or 
it can vary from one time window to the other. In the latter 
situation, we assume that the probability F(w) of having w 
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walkers on the network in a window of length M is equally 
distributed in the range [W — 6, W + 6], i.e., 



F(w) 
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with 1 < S < W . To find an expression for the average 
number of packets (fi) flowing through a given node i, we 
first calculate the probability Pi (n) that, after M time steps, 
n packets have visited node i. 

In the case w = W, due to the fact that the packets are 
not interacting, the arrival of walkers at a node is a Poisson 
process. Therefore, after a period of M time units, the mean 
number of packets (the average flux) at a node i is (fi) = 
Xi(w)M, and the probability of having n packets reads 



Pi 
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with a = y/ \i(w)M = \J (fi)- Thus, the scaling exponent is 
a = 1/2. 

In the more general case in which the number w is dis- 
tributed as in Eq. (gj), the probablity Pi (n) is 
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Calculating first and second moments of Pj(n) one obtains 

kiWM 
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(fi) = 5>P,(n) 



(5) 



Finally, the standard deviation can be expressed as a function 

of (fi) as 



(fi)( l + (/i>„ 
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The above derivation provides an understanding of the ori- 
gins of Eq. (|2l), proposed in and shows that the relation 
between er; and (/j) depends on the concurrent effects of three 
factors, namely: (i) the noise 5 associated to the fluctuations 
in the number of packets in the network from time window to 
time window; (ii) the length M of the time window; and (Hi) 
the degree of the node ki (since (fi) depends on fc,). Con- 
sequently, real traffic rarely falls in either of the two limiting 
cases of Eq. (O, i.e., a ~ (/)" with a = 1/2 or 1. 

Expression (|7]l contains all the behaviors previously ob- 
served in Refs. ||9l and also predicts new dependencies 
that can be tested to be valid in more refined traffic models as 
well as in real data. In fact, if the three quantities S, M and ki 
are such that 
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FIG. 1: Flow fluctuation a as a function of (/) for the RD model 
with various parameter values. In panel (a), 8 = 10 3 and W = 10 4 . 
In panel (b), W has the same value while M has been fixed to 10. In 
both figures, points correspond to the solution of Eq. 10 for different 
values of ki (1 . . . 18). The total number of links is K = 33500. 
Dashed lines are guides to the eyes and correspond to a ~ {/)", 
with a = 1/2 (lower curves) and a — 1 (upper curves). See the text 
for further details. 



expression (0 reduces to a power-law scaling a ~ (f) a with 
exponent a = 1/2. On the contrary, whenever the ratio k ^£w 
is not negligible anymore, the exponent a differs from l/2 
and approaches 1 . In other words, it may well be the case in 
which, even for small values of the noise parameter S, a large 
value of M cancels out the effect of the ratio 4? being too 
small in Eq. (0. This behavior was already explored in 111 111 
by means of numerical simulations. However, the fact that 
the ratio in formula (HJ depends quadratically on 8 and only 
linearly on ki and M, has gone unnoticed. The RD model 
puts such dependence on solid theoretical grounds, and also 
reveals the role played by the other two parameters M and ki 
on the observed scaling. 



In Fig. Q] we plot the dependence of a with (/) in the RD 
model for several values of the parameters M and 6. Panel (a) 
corresponds to the case in which the ratio ^ = 10 _1 is fixed 
and the length of the time windows used to measure the flow 
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of packets through different nodes is varied. For each value of 
M, we have superimposed the results obtained for nodes with 
different connectivity values, ranging from fcj = 1 to fcj = 18. 
If one follows the arguments given in Jgt], a value of a = 1/2 
should be expected for this choice of 5/W . Instead, as shown 
in the figure, a ~ (/)= only for small values of M, while the 
scaling exponent approaches 1 as M is increased. This means 
that, whenever the temporal resolution in the measurements 
is not small enough and packets are counted and accumulated 
over long periods, a tends to 1. 

A novel striking feature revealed by law and not re- 
vealed in previous studies, is the dependence with the degree 
of the nodes. An example of the effects of node degrees is 
shown in Fig.QJ. It turns out that, for some values of M (e.g. 
M = 10 2 in the figure), the fluctuations at lowly connected 
nodes are characterized by an exponent a = 1/2, whereas for 
highly connected nodes the exponent turns out to be a = 1. 
Hence, there is not a single exponent characterizing the fluc- 
tuations at every node of the network, regardless of its con- 
nectivity. This is again a clear indication that a power-law be- 
haviour, a ~ (f) a , even with non-universal exponents rang- 
ing in [1/2, 1], is not the most general situation when char- 
acterizing the flow fluctuations for a whole network S IT ] . 
Admittedly, a is not constant for every possible choice of the 
parameters 5, W and M along the whole set of fcj values. 
This effect is particularly relevant for highly heterogeneous 
networks like the Internet, where degree classes span several 
decades. In these kinds of networks, one should therefore ex- 
pect different scaling laws depending on whether the packets 
are flowing through lowly or highly connected nodes. 

The influence of the noise level on a for a fixed time win- 
dow length (A/ = 10) is depicted in Fig.[TJ>. When 6 is small, 
so that the number of packets in the network from one time 
frame to the following does not change significantly, a = 1/2. 
On the contrary, when S is sufficiently large, the exponent is 1. 
This is more in consonance with the results in where the 
dependence with the noise level was addressed only for a low 
value of M, getting that as S increases a — > 1. On the other 
hand, we observe again that fixing M and varying S does not 
guarantee the existence of a unique exponent for the scaling of 
fluctuations in traffic flow, though in this case the dependence 
is smoother than that observed in Fig.ITJj. 

In the following we show that expression (O predicted 
by the RD model is indeed valid for more elaborated traf- 
fic models, and that the RD approximation captures the phe- 
nomenology of real communication systems. We report the 
results obtained on top of synthetic scale-free (SF) networks 
with N = 10 4 nodes and power-law degree distributions 
Pk ~ fc -7 , with an exponent 7 = 2.2 as the one empirically 
observed for the Internet at the autonomous system level J2l. 
However, we stress that since the topological properties of the 
underlying graph only enter into Eq. (0 through the degree of 
the nodes fc, and the total number of links in the network, K, 
the results hold for any graph with an arbitrary degree distri- 
bution pk as our own simulations using SF networks, random 
graphs and a real autonomous system map of the Internet J2l 




FIG. 2: (color online) Flow fluctuation a as a function of (/) 
from numerical simulations of the Internet traffic model (see text 
for details) on synthetic scale-free networks with N = 10 4 nodes, 
K — 37551 links, and degree exponent 7 = 2.2. Different pan- 
els correspond to different values of M, respectively M — 1,5 x 
10 2 , 35 x 10 3 , 10 5 . Color-coded values represent the logarithm of 
node degree. The continuous line is the curve y = a; ' 5 , while the 
dashed line is y ~ x. 



reveal. 

On the other hand, to mimic the way packets flow in real 
communication networks, we consider a dynamical model 
that is able to simulate Internet's most important dynamical 
characteristics @ Q]. The dynamics of the packets is simu- 
lated as follows. Each node represents a router with an in- 
finite size buffer. The delivery of packets is made following 
a First In First Out (FIFO) policy. At each time step, p new 
packets are introduced in the system with randomly chosen 
sources and destinations 014I1 . Packets routing is based on 
a traffic-aware scheme flU in which the path followed by 
a packet is that that minimizes the effective distance d\ e = 
hd t + (1 — h)ai, where di is the distance between node i and 
the packet destination, Cj is the number of packets in i's queue, 
and ft, is a tunable parameter that accounts for the degree of 
traffic awareness incorporated in the delivery algorithm 
It is worth recalling that h = 1 recovers a shortest-path de- 
livery protocol, mimicking most of the actual Internet routing 
mechanisms. 

Figure|2]shows a as a function of (/) obtained through ex- 
tensive numerical simulations of the traffic model with h = 1 
and p — 2. Different panels in the figure correspond to dif- 
ferent values of the time-window length M. The results indi- 
cate that the main responsible of the value of a (interpolating 
between the two extreme a = 1/2 and a = 1) is the inter- 
play between the node degree and the time resolution used to 
record the flux of packets, exactly as predicted by the scal- 
ing law <j7j obtained in the RD model. In fact, Fig. [SJ; cor- 
responds to the choice of parameters for which formula ^ 
holds for all values of fej, leading to a = 1/2. On the con- 
trary, when M is large enough and the other parameters are 
kept fixed as in Fig. relation ^ is not satisfied whatever 
the value of ki used, hence giving an exponent a = 1. Finally, 
the breakdown of the scaling law a ~ (f) a anticipated by the 
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of statistical nature. More importantly, the theoretical law re- 
veals that the previously claimed power-law scaling (with uni- 
versal or non-universal exponents) has to be abandoned. Our 
numerical results and the analysis of real data confirm that, 
even in the presence of correlations between packets, one can- 
not assume a single exponent to characterize the fluctuations 
of traffic for the whole network. Finally, we note that the 
scaling breakdown predicted here is amenable to experimen- 
tal confirmation by measuring the traffic flow in large com- 
munication networks so to capture the predicted (topological) 
effects of degree heterogeneity. 

We thank A. Arenas, De Felice, G. Drovandi and J. Duch 
for helpful comments and discussions on this work. Y. M. is 
supported by MEC through the Ramon y Cajal Program. This 
work has been partially supported by the Spanish DGICYT 
Projects FIS2006-12781-C02-01 and FIS2005-00337. 



FIG. 3: Flow fluctuation a as a function of (/} for the Abilene Inter- 
faces. The values of M used in each panel are: M — 5 (a), M = 30 
(b), M = 60 (c), and M = 720 (d). Time is in minutes. The value 
of a for each M is also reported. Averages are taken over one month 
of data corresponding to the period between January 1 1 to February 
1 1 of 2006. 



RD model is captured in Figs. [2p and c, where it is clearly 
revealed that there is not a unique exponent characterizing the 
flow of packets through every node of the network. Indeed, 
there is a crossover from a ~ (f) 1 ^ 2 for lowly connected 
nodes to a ~ (/} for the highly connected ones. We also note 
that a similar behavior is observed (figures not shown) when 
traffic-aware routings {h < 1) are taken into account. 

Finally, we have also analyzed the data corresponding to the 
traffic between routers of the Abilene backbone network 115 ] . 
As the data collected for the routers in the backbone corre- 
spond only to the flow between them, this backbone network 
can be viewed as an isolated communication system where the 
routers create, delivery and receive data packets. Therefore, 
the measures effectively correspond to a small network han- 
dling a large amount of traffic and whith all its nodes having 
a similar degree. For this reason, we are not able to observe 
here the dependence with the node degree. However, at vari- 
ance with the analysis performed in jl ill , we have varied the 
length of the time windows used to extract the flux and its de- 
viation 111611 . Once again, the results, depicted in Fig. [3] show 
that the exponent a is not universal and radically depends on 
M. Note that, although the lower bound of a = 0.706 > 1/2 
is determined by the minimal resolution (M = 5 minutes) 
of the raw data, further increasing M will recover the upper 
bound a = 1. 

In summary, in this paper we have derived a theoretical law 
for the dependence of fluctuations with the mean traffic in a 
network. Such a dependence is governed by three factors: one 
related to the dynamics, one related to the topology, and one 
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